NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Adapting Rapid Motor Adaptation for Bipedal Robots

https://doi.org/10.1109/IROS47612.2022.9981091

Kumar, Ashish; Li, Zhongyu; Zeng, Jun; Pathak, Deepak; Sreenath, Koushil; Malik, Jitendra (October 2022, Proceedings of the IEEERSJ International Conference on Intelligent Robots and Systems)

Full Text Available
Functional Regularization for Reinforcement Learning via Learned Fourier Features

Li, Alexander; Pathak, Deepak (December 2021, Advances in neural information processing systems)

We propose a simple architecture for deep reinforcement learning by embedding inputs into a learned Fourier basis and show that it improves the sample efficiency of both state-based and image-based RL. We perform an infinite-width analysis of our architecture using the Neural Tangent Kernel and theoretically show that tuning the initial variance of the Fourier basis is equivalent to functional regularization of the learned deep network. That is, these learned Fourier features allow for adjusting the degree to which networks underfit or overfit different frequencies in the training data, and hence provide a controlled mechanism to improve the stability and performance of RL optimization. Empirically, this allows us to prioritize learning low-frequency functions and speed up learning by reducing networks' susceptibility to noise in the optimization process, such as during Bellman updates. Experiments on standard state-based and image-based RL benchmarks show clear benefits of our architecture over the baselines. Code available at https://github.com/alexlioralexli/learned-fourier-features
more » « less
Full Text Available
Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives

Dalal, Murtaza; Pathak, Deepak; Salakhutdinov, Ruslan (December 2021, Advances in neural information processing systems)

Despite the potential of reinforcement learning (RL) for building general-purpose robotic systems, training RL agents to solve robotics tasks still remains challenging due to the difficulty of exploration in purely continuous action spaces. Addressing this problem is an active area of research with the majority of focus on improving RL methods via better optimization or more efficient exploration. An alternate but important component to consider improving is the interface of the RL algorithm with the robot. In this work, we manually specify a library of robot action primitives (RAPS), parameterized with arguments that are learned by an RL policy. These parameterized primitives are expressive, simple to implement, enable efficient exploration and can be transferred across robots, tasks and environments. We perform a thorough empirical study across challenging tasks in three distinct domains with image input and a sparse terminal reward. We find that our simple change to the action interface substantially improves both the learning efficiency and task performance irrespective of the underlying RL algorithm, significantly outperforming prior methods which learn skills from offline expert data. Code and videos at https://mihdalal.github.io/raps/
more » « less
Full Text Available
Hierarchical Neural Dynamic Policies

https://doi.org/10.15607/rss.2021.xvii.023

Bahl, Shikhar; Gupta, Abhinav; Pathak, Deepak (July 2021, Robotics: Science and Systems)

We tackle the problem of generalization to unseen configurations for dynamic tasks in the real world while learning from high-dimensional image input. The family of nonlinear dynamical system-based methods have successfully demonstrated dynamic robot behaviors but have difficulty in generalizing to unseen configurations as well as learning from image inputs. Recent works approach this issue by using deep network policies and reparameterize actions to embed the structure of dynamical systems but still struggle in domains with diverse configurations of image goals, and hence, find it difficult to generalize. In this paper, we address this dichotomy by leveraging embedding the structure of dynamical systems in a hierarchical deep policy learning framework, called Hierarchical Neural Dynamical Policies (H-NDPs). Instead of fitting deep dynamical systems to diverse data directly, H-NDPs form a curriculum by learning local dynamical system-based policies on small regions in state-space and then distill them into a global dynamical system-based policy that operates only from high-dimensional images. H-NDPs additionally provide smooth trajectories, a strong safety benefit in the real world. We perform extensive experiments on dynamic tasks both in the real world (digit writing, scooping, and pouring) and simulation (catching, throwing, picking). We show that H-NDPs are easily integrated with both imitation as well as reinforcement learning setups and achieve state-of-the-art results.
more » « less
Full Text Available
Unsupervised Learning of Visual 3D Keypoints for Control

Chen, Boyuan; Abbeel, Pieter; Pathak, Deepak (July 2021, International Conference on Machine Learning)
null (Ed.)
Learning sensorimotor control policies from highdimensional images crucially relies on the quality of the underlying visual representations. Prior works show that structured latent space such as visual keypoints often outperforms unstructured representations for robotic control. However, most of these representations, whether structured or unstructured are learned in a 2D space even though the control tasks are usually performed in a 3D environment. In this work, we propose a framework to learn such a 3D geometric structure directly from images in an end-toend unsupervised manner. The input images are embedded into latent 3D keypoints via a differentiable encoder which is trained to optimize both a multi-view consistency loss and downstream task objective. These discovered 3D keypoints tend to meaningfully capture robot joints as well as object movements in a consistent manner across both time and 3D space. The proposed approach outperforms prior state-of-art methods across a variety of reinforcement learning benchmarks. Code and videos at https://buoyancy99.github. io/unsup-3d-keypoints/.
more » « less
Full Text Available
Unsupervised Learning of Visual 3D Keypoints for Control

Chen, Boyuan; Abbeel, Pieter; Pathak, Deepak (January 2021, International Conference on Machine Learning)

Learning sensorimotor control policies from high-dimensional images crucially relies on the quality of the underlying visual representations. Prior works show that structured latent space such as visual keypoints often outperforms unstructured representations for robotic control. However, most of these representations, whether structured or unstructured are learned in a 2D space even though the control tasks are usually performed in a 3D environment. In this work, we propose a framework to learn such a 3D geometric structure directly from images in an end-to-end unsupervised manner. The input images are embedded into latent 3D keypoints via a differentiable encoder which is trained to optimize both a multi-view consistency loss and downstream task objective. These discovered 3D keypoints tend to meaningfully capture robot joints as well as object movements in a consistent manner across both time and 3D space. The proposed approach outperforms prior state-of-art methods across a variety of reinforcement learning benchmarks. Code and videos at https://buoyancy99.github.io/unsup-3d-keypoints/
more » « less
Full Text Available
RB2: Robotic Manipulation Benchmarking with a Twist

Dasari, Sudeep; Wang, Jianren; Hong, Joyce; Bahl, Shikhar; Lin, Yixin; Wang, Austin S; Thankaraj, Abitha; Singh Chahal, Karanbir; Calli, Berk; Gupta, Saurabh; et al (January 2021, NeurIPS 2021 Datasets and Benchmarks Track)

Full Text Available
Toward Multimodal Image-to-Image Translation

Zhu, Jun-Yan; Zhang, Richard; Pathak, Deepak; Darrell, Trevor; Efros, Alexei; Wang, Oliver; Shechtman, Eli (January 2017, Advances in neural information processing systems)

Full Text Available

Search for: All records